Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network

نویسندگان

  • Raghuraj Singh
  • C. S. Yadav
  • Prabhat Verma
  • Vibhash Yadav
چکیده

There are about 300 million people in India who speak Hindi and write Devnagari script. Research in Optical Character Recognition (OCR) is popular for its application potential in banks, post offices, defense organizations and library automation etc. However most of the OCR systems are available for European texts. In this paper, we have proposed a technique for OCR System for different five fonts and sizes of printed Devnagari script using Artificial Neural Network. The recognition rate of the proposed OCR system with the image document of Devnagari Script has been found to be quite high.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Java Based Devnagari Script Recognition using JNI ( Java Native Interface )

Optical Character Recognition (OCR) is a form of computer vision that extracts alphanumeric characters from a digital image. The technology can be used for digitizing printed text, handwriting recognition, and making digital images searchable for text. Following the paper, an implementation of OCR Recognition for Devnagari script in JAVA will be presented and analysed. Keywords— JNI, CLP, XCLP.

متن کامل

A Modfied Self-organizing Map Neural Network to Recognize Multi-font Printed Persian Numerals (RESEARCH NOTE)

This paper proposes a new method to distinguish the printed digits, regardless of font and size, using neural networks.Unlike our proposed method, existing neural network based techniques are only able to recognize the trained fonts. These methods need a large database containing digits in various fonts. New fonts are often introduced to the public, which may not be truly recognized by the Opti...

متن کامل

Multi-font Optical Character Recognition System for Printed Telugu Text

The Telugu OCR systems available in the market currently recognize only the specific fonts of Telugu. This paper describes the development of a multi-font OCR system for printed Telugu characters using Artificial Neural Networks. In this system classification of the characters is carried out using multi layer neural network Architecture.

متن کامل

A Comparative Analysis of Classifiers Accuracies for Bilingual Printed Documents (Oriya-English)

Bilingual document recognition has been the subject of intensive research and our focus is on the recognition of an Oriya-English bilingual documents. In most of our official papers, school text books, it is observed that English words interspersed within the Indian languages. So there is need for an Optical Character Recognition (OCR) system which can recognize these bilingual documents and st...

متن کامل

On the Performance of Devnagari Handwritten Character Recognition

This paper presents the offline handwritten character recognition for Devnagari, a major script of India. The main objective of this work is to develop a handwritten dataset (CPAR-2012) for Devnagari character and further develop a character recognition scheme for benchmark study. The present dataset is a new development in Devnagari optical document recognition. The dataset includes 78,400 sam...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010